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A rule to assign physical meaning to Lagrange multipliers is discussed. Examples 
from mechanics, statistical mechanics and quantum mechanics are given. 

; I. INTRODUCTION 

(D 

• ^ I Lagrange multipliers arise frequently in physics, engineering, economics and mathematics 

^ ' in optimization problems with constraints. In statistical mechanics it arises in microcanon- 

ical derivation of distribution laws of quantum gases and in some other problems where 

>■ ■ entropy or free energy is maximized or minimized under the constrains of fixed total parti- 

^ ■ cle number and/or energy. In classical mechanics Lagrange multipliers arise in Lagrangian 
p 

o 
o 



formulation of mechanics with constraints where multipliers are interpreted as constraint 
forces. In economics and engineering there are plenty of problems that requires optimiza- 
tion of some quantity under some constraints and Lagrange multipliers arise naturally in 



^ ■ these fields. 

In most of these cases Lagrange multiplier turns out to have a physical meaning at the 
end of calculation. In economics people seem to have long been aware that the multipliers 
are related to rate of change of optimized quantity with respect to some parameter in 
the calculation. In physics it is well known that multipliers usually have precise physical 
meanings although there is not a general rule to identify their physical meaning directly. 
In statistical mechanics Lagrange multipliers occurs in many places and the pedagogical 
articles on them seem to concentrate on how to teach statistical mechanics without using 



Multipliers 

mm 



Lagrange Multipliers [l|, |2|, y, Ij, |5j rather than how to find precise physical interpretation 
of the multipliers. Statistical mechanics textbooks derive physical meaning of multipliers 
by using supplementary arguments. I have long felt that (at least in statistical mechanics) 
there must be a direct rule to identify the physical meaning of multipliers and after some 
research I found that such a rule exist and It works well if the constrains have a well defined 
physical meaning. It does not seem to work for quantum applications in particular where 
quantities such as overlap integrals between orbitals do not have a clear physical meaning. 
But in some areas of physics and engineering as well as in economics the rule works well. 

I have come across this rule in economics textbooks and economists have been aware of it 
for long time. Even the books on mathematical economics written in the sixties and seventies 
have the ruleQ] and there are articles(see Baxley and Moohousej^ for example) discussing 
the Lagrange multipliers in economics. Engineers too are interested in Lagrange multipliers 

n 

and Bertsekas's book[8] on Lagrange multipliers has the above mentioned rule. Traditionally 
Lagrange multipliers method is introduced in calculus books and they do not discuss physical 
meaning of multipliers. Even this is changing and two of recent calculus booksl9|, 110| discuss 

n 

the meaning of multipliers and present the rule without proof(Actually [ICi] provides some 
justification). The Lagrange multipliers are used in constrained variational problems and a 
proper extension of the rule exists in the literature on calculus of variations, jllj] 

I have checked the statistical mechanics books that I was able to reach and none of them 
mentions the rule and I have not come across it elsewhere in the physics literature that I 
am familiar with. It seems that the rule is not well known in physics community. This 
prompted me to write this paper and discuss use of this rule in physics. In the paper I will 
show how the rule makes interpretation of multipliers easy in statistical mechanics and I 
will also present a quantum application where the rule is not applicable. I also provide some 
examples that shows how the rule makes interpretation of Lagrange multipliers easy in some 
mechanics problems. 

In the following section I derive the above mentioned rule for discrete variables and I 
discuss two examples. In the third section I derive the same rule for constrained variational 
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problems and I discuss three examples. 



II. DERIVATION OF THE RULE FOR DISCRETE VARIABLES AND 

EXAMPLES 

First we start by reminding the reader how Lagrange multipliers used in constrained 
optimization. Suppose we want to find a local maximum or minimum of a function 
f{xiX2, ■■■,X]y) under the constraints 

gi{xiX2,...,XN) = Ui, (i = 1,2, ...,M). (1) 

Lets denote the maximum/minimum point with starred characters {xlx2, ■■■,x*^) and lets 
use the compact vector notation = {xiX2, ■■■,X]\f) and T^* = (x^ X2, ...,x*j^). We also use 
N-dimensional gradient 

V/ = -^ei + -^62 + ... + -TT^eN, 2 
0x1 0x2 OXn 

where 61,62. ..cat are the unit vectors in the directions X1X2, ...,xn- With this notation, at 
the extremum point we must have 

M 

^f{T'*)=Y.K^9^{^*), (3) 

1=1 

where Ai, A2, Xm are the Lagrange multipliers. This represents sets of equations. We 
also have constraints in eq. ([T]) which are M sets of equations. We have a total of + M 
equations and N + M unknowns {xl X2, x*^) and Ai, A2, Xm- and therefore they can be 
solved in principle. Sometimes this sets of equations are written as 

d^/dxn = (n = l,2,...,iV), (4) 
d^/dX^ = (m = l,2,...,M). (5) 

where $ is the auxiliary function 

M 

<l>(r, Ai, Am) = /(r) - ^ Kig^i^) - u,). (6) 

i=l 



4 



Now we are ready to derive the rule for meaning of the Lagrange multiphers. The 
coordinate of the extremum point "r** will be a function of Ui,U2, ■■■,um- Therefore the 
value of /(^) at the extremum point "r^* will be a function of ui,U2, ■■■,um too. We denote 
this function as F{ui,U2, um) '■ 

F(ui,U2, ...,MAf) = /(^*)- (7) 

Now, suppose each Ui changed by an infinitesimal amount: Ui ^ Ui + dui. This changes the 
extremum point 7^* by dPf*: '•f* — > "r^* + dPf*. The dui and dl^* are connected: 

dUi = QiiT"* + d't*) - Qii't*) = "^giiy*) ■ d'r'*. (8) 

The change in F is 

dF = f{l^* + dl^*) - f{l^*) = '^/(^*) ■ dl^*. (9) 
The ^ fi'r^*) can be written from eq.Q as 

M 

dF = J2X,-Vg,ir*)-dr*, (10) 

i=l 

dF = \idui + \2du2 + ... + XuduM- (11) 

This clearly tells us that 

A, = dF{m, «2, UM)/dui {i = 1,2,..., M) (12) 

Usually both F{ui,U2, um) and Ui, U2, ...,um have well defined physical meaning and this 
relation assigns a physical meaning to the Lagrange multipliers. 



A. Examplel. Maximum range of a projectile 

It is instructive to do this elementary exercise using Lagrange multiplier. If the compo- 
nents of initial velocity are Vx, Vy then the flight time of the projectile is 2Vy/g and the range 
is RiVx, Vy) = 2VxVy/g. The kinetic energy of the projectile is fixed: miV^ + Vy)/2 = E. 
The auxiliary function $(V^, V^, A) is 

$(K., Vy, A) = 2VxVylg - A [m{V^ + if )/2 - e] . (13) 
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Taking partial derivatives with respect to Vx,Vy and A we get the set of equations 



2Vy/g — \mV^ 



0, 



(14) 



2Vjg - \mVy 



(15) 



^(v;^ + )/2 



E. 



(16) 



The solutions are 



v: = v; = ^eJ^, 



(17) 



A = 2/mg. 



(18) 



The maximum range is 



R{y:,V;) = {2/mg)E, 



(19) 



and A = dR{V* ,V*)/dE is satisfied. The meaning of A here is the rate of increase of 
maximum range with energy. 

B. Example2. Microcanonical derivation of quantum gas distributions 

Most applications of Lagrange multipliers involve only one multiplier and some involve 
two multipliers. In physics applications involving more than two multipliers are extremely 
rare. Here we present a common application in statistical mechanics involving two multipli- 
ers. It is the uninteracting quantum (Both Bose and fermi) gases. 

We follow Huang's statistical mechanics textbookpjj] in the following discussion. We 
first divide the energy into small intervals AEi. In every interval AEi we have gi single 
particle levels and rii particles occupying these levels. The system size (particle number and 
volume) is huge such both the gi and rii are large numbers. But the energy intervals AEi 
are small enough such that we can assume that all g^ have the same energy Ei where Ei is 
some average energy value within the AEi interval (say the midpoint of the interval). By 
taking a larger and larger system we can make these assumption more and more accurate. 
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The number of different ways of putting rij particles into gi degenerate single particle 
states for Bose particles is 



and for fermi particles is 



{rij + gi - 1)! 



W^ = (21) 

ni\{ni - gi)! 



The total number of ways can be written as 

W{n,} = l[w,. (22) 

i 

We want to maximize subject to the constraints 

J2n, = N, (23) 

i 

Y^n^Ei = U. (24) 

i 

Instead of maximizing lyjrij} we maximize kB\Ta.{W{ni}) for convenience. We write the 
auxiliary function A) 



$({n,}, A) = E Hwi) - Ai (^Y. ^^^^ (y - ^) 



(25) 



and use the Stirling approximation to express the factorials in Inwj and set derivatives 
d^/dui to zero to obtain the equations that {ui} satisfy. When they are solved we obtain 

n* = ^ (26) 

' exp{X,E,/kB + X2/kB)±l' ^ ' 

where + is for fermions and — is for bosons in the denominator. Here gi » 1 is used 
to obtain this result. The Ai and A2 is found by solving the equations ^n* = N and 

i 

J^^iEi = U simultaneously. 

i 

Now we have the problem of identifying the thermodynamical meaning of Ai and A2. 
Staying within the microcanonical ensemble this identification requires considerable amount 
of supplementary argument. Of course one can derive eq. (12^ using grand canonical ensem- 
ble where the quantities appearing in place of Ai and A2 have fixed meanings in the grand 
ensemble. But in teaching statistical mechanics we usually want to show that all ensem- 
bles are equivalent an we should be able to find physical meaning of Ai and A2 staying in 
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microcanonical ensemble too. Our rule makes this identification a trivial exercise as shown 
below. 

In the microcanonical ensemble the entropy is 

S = kBln(j2VV{ni}] , (27) 
\K} / 

where the sum runs over all possible {rii} sets. It is shown in statistical mechanics textbooks 
that in the thermodynamic hmit this is equal to /cBlnl^{n*}. Therefore A;Blnl^{n*}is the 
entropy S{U, N) as a function of internal energy U and the particle number N. Prom the 
rule it follows that the Ai and A2 are 

Ai = {dS/dU)^ (28) 
A2 = (dS/dN)u. (29) 

The first one is easily interpreted as 1/T where T is the temperature of the system. In order 
to interpret the second one we use the well-known relation 

{dS/dN)u ■ {dN/dU)s ■ {dU/dS)^ = -1. (30) 

Prom thermodynamics {dU/dS)^ = T and {dN/dU)s = /i"^ where ji is the chemical 
potential of the system. Then we obtain A2 = {dS/dN)u = —/i/T. 

As can be seen from this example, in statistical mechanical applications the rule makes 
supplementary arguments to interpret multipliers unnecessary. I invite the reader to try 
to find another argument as short as this one (staying within microcanonical ensemble) 
without using the rule. In statistical mechanics (unlike quantum mechanics for example) 
both constrained and optimized quantities always have well defined physical meanings and 
identification of thermodynamical meaning of the multipliers is only one- line argument using 
this rule. Once the rule is introduced beforehand (in an appendix of a book or as a handout in 
a class for example) identification of multipliers will be a very precise and concise argument. 
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III. DERIVATION OF THE RULE IN VARIATIONAL PROBLEMS AND 

EXAMPLES 

We will not discuss the theory of variational calculus here or how the Lagrange multipliers 
are used in variational calculus. We will merely state basic results and derive the equivalent 
formula for the interpretation of Lagrange multipliers. 

In the simplest case of variational problems we are trying to find the function y{x) that 
makes the integral 

6 

^[y] = J fix,y,y')dx, (31) 

a 

an extremum where y' = dy/dx and y{a) and y{h) are fixed. The function y* that makes 
I[y\ an extremum satisfies the Euler differential equation 

^ = ^-Af^Uo. (32) 

5y* dy* dx \d{y*)' ) 

If we have constraints 

JM = ]9^{x,y,y')dx = U, = 1, 2, M), (33) 

a 

then we solve the differential equation 

^^EA.^ (34) 
5y* t[ 

for y*{x) which depends on Lagrange multipliers Ai, Aa/ parametrically. When we put y* 
in the constraint equations eq. (!33|) we obtain a set of algebraic (in general nonlinear) sets 
of equation for Ai, Am- The solutions for Ai, Xm will be a function of the parameters 
Ui,U2, ■■■,Um and when the multipliers are put in y*{x) back, the y*{x) itself will be a 
function of Ui,U2, ■■■,Um parameters. When we feed y*{x)m the integral for I[y] we get 
I[y*] which will be a function of Ui, U2, Um too. We denote this function as I(U) where 
U stands for Ui, U2, Um- So we have I(U) = I[y*]. 

Now we change the parameters Ui, U2, Um by an infinitesimal amount: f/, — f/j + dUi 
and this changes the solution y* by an infinitesimal amount: y* ^ y* + Ay* where Ay* is 



9 



an infinitesimal function, tliis cfianges I(U) by 

rfl(U) = I[y* + Ay*] - I[y*] = j ^ (Ay*) dx. (35) 
For tlie constraint we also have 

dU., = My* + Ay*] - ^y*] = J {6My*]/6y) (Ay*) dx. = 1, 2, M). (36) 

a 

Putting the 6I[y*]/6y from eq. flM|) into the eq. fl35|) and comparing to the eq. fl36l) we obtain 

dI(U) = Aidf/i + X2dU2 + XndUM, (37) 

which tells us that 

Xi = dl{Ui, U2, UM)/dU, (z = 1, 2, M). (38) 

Usually both I(U) and U have physical meaning and this prescription assigns a physical 
meaning to the Lagrange multipliers Ai, Am- 

A. Examplel. Hanging chain 

In mechanics Lagrange multipliers are used in many places. One kind of application 



involves constrained motion in Lagrangian formulation of mechanics. [131] In this application 
Lagrange multipliers are time dependent and it is well known that they are equal to con- 
straint forces. The author does not see how our rule applies to this case if it does. But it is 
possible to find mechanics problems where the rule applicable. Here we give an example to 
this. 

Suppose a chain of fixed length L and density p is hanging from two points (±a, 0) in the 
X — y plane. The shape of the chain y{x) will be such that the potential energy integral 

V[y] = pg r y^l + Wdx (39) 

J —a 

will be a minimum under the constraint that the length of the chain is fixed: 

r + {y'fdx = L (40) 

J —a 
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Setting up the Euler equation with a Lagrange multipher is straightforward. Solving the 
arising differential equation and imposing the boundary conditions |/(±a) = we obtain the 
solution 

y*{x) = — [cosh(ax) — cosh(aa)] , (41) 

A = — — cosh(aa), (42) 
a 

where a is an integration constant. To obtain a relation between a and L we put y*{x) in 
the eq. pOj) to obtain 

2 

— sinh(aa) = L. (43) 
a 

We also calculate V[y*] by putting the y* in the eq. flS^ to obtain 

..r *n fd sinh(2Q;a)\ ,,,, 
V[y*] =pg[- ^ . 44 

To show that dV[y*]/dL = A we write dV[y*]/dL as 

dV[y*] dV[y*]/da 



dL dL/da ' 

and calculate dL/da and dV[y*]/da from eqs. (I43II44I) to obtain 



(45) 



— j^—^ = cosh(Q;a) = A. (46) 

dL a 

We showed that dV[y*]/dL = A holds. 

Now what is the physical meaning of the Lagrange multiplier A? Imagine pulling the 
chain from one end by an infinitesimal amount dL. The work done is T ■ dL where T is 
the tension at the endpoints and this should be equal to the potential energy difference 
V[y*{L - dL)] - V[y*{L)] = -dV[y*]. Therefore T = -dV[y*]/dL = -A and we have the 
physical interpretation that —A is equal to the tension at the ends of the chain. As usually 
the case in mechanics, here too the multiplier turns out to be a constraint force. 

Here application of the rule dV[y*]/dL = A makes the interpretation of multiplier a simple 
one- line argument. I invite the reader to try to find another argument as short as this one 
for the physical meaning of the multiplier without using the rule. 
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B. Example2. Constrained brachistochrone problem 

Here we present a mechanics problem that the multipher is not a constraint force and 
yet has a very precise meaning. We have a rail that extends from point A (x — 0,y = h) to 
the point B {x — a,y — 0) and the rail has a fixed length L > y/a?~+li^. A mass slides on 
it from point A to point B. What is the curve that delivers the shortest travel time? 

Here the travel time 

[T l l + {dy/dx)\ 
^^y^ = iYgL V h-y 

is minimized with the constraint 

U[y] = £ ^1 + {dy/dxfdx = L. (48) 

The Euler equation 

^ = A^. (49) 

6y 5y 

yields (after integrating once) the differential equation 

CJl + {dy/dxf = , ^ - X (50) 

where C is an integration constant. Because of difficulties with evaluating some integrals 
we were not able to solve this problem in closed form. Therefore we do not give a explicit 
demonstration that the rule A = dT[y*]/dL holds. But assuming that the multipher is 
calculated (numerically or using some approximation) we can tell physical meaning of it. It 
is rate of increase of optimal travel time with increasing length L. Physically we expect that 
as L starts increasing from the initial value VaF+T^ the travel time T[y*] should decrease 
(hence negative A) until an optimal length Lq is reached. When L — Lq we have A — and 
problem reduces to unconstrained brachistochrone problem and travel time is the absolute 
minimum. As L is increased further the travel time should increasing again yielding positive 
A. Here we see that, using both our knowledge of physics and the rule enables us to find the 
sign of A without solving the problem. 
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C. Examples. Minimum uncertainty wave packet 

In some quantum problems the Lagrange multipliers have no apparent physical meanings. 
These cases arises especially for normalization and overlap integral constraints. Here we give 
an example that has two Lagrange multipliers. One of the Lagrange multipliers has a clear 
physical meaning obeys the rule and the other does not. 

We want to find the wave packet with the minimum momentum uncertainty 

oo / oo \ 2 

M[^] = J iP*P^iljdx - I J il^*P^dx J = al (51) 



-oo 



with the conditions that position uncertainty is fixed 

oo / oo \ 2 

K[^l)]= j ^*x^^dx- { j ^*x^dx\ =al, (52) 

— oo Voo / 

and the wave function is normalized 

oo 

Nli/j] = J iP*il;dx = u. (53) 

— oo 

Here P — —ihd/dx momentum operator and for the sake of the argument we consider 
normalization integral with a general fixed value u. We later take u = 1 after showing that 
u ^ 1 has no solution. Treating ip and ip* as independent variables the Euler equation with 
two Lagrange multipliers 

Mf[^_ (54) 

dijj* Sijj* dijj* 

yields the equation 

- h'^ij"{x) + 2ihbip'{x) - Ai (x^ - 2ax) ^{x) - X2ip{x) = (55) 
where a and b are the parameters defined as 

oo 

a — J 'ip{xyx^{x)dx, (56) 

— oo 
oo 

6 = y ijj*(x)P'ijj(x)dx. (57) 
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As a consistency condition the solution of eg. (155!) should satisfy eqs.( !56|57|) . Making the 
transformation ijj(x) = e^^^^^Lp{x) we obtain the equation 

- h^^'\x) - Ai (x - af Lp{x) = (A2 + 6^ - Xia^) f. (58) 

This is the Schrodinger equation for a Harmonic oscillator problem (m = 1/2 and w = 



a/— 4Ai) and normalizable solutions are possible only for Ai < 0. The solutions are the 
harmonic oscillator eigenfunctions 00, 0i, ...,</>„, ... where 0„(a;)is normalized in the usual 
way: J \(f)n{x)f dx = 1. But the ip{x) must satisfy the normalization condition given in 
eq. (!53l) . Therefore we take ^n{x) = A/ue*''^/^</)„(x). With this -ipix) we get the following 
expectation values 

00 

ipn{x)x'^n{x)dx = ua, (59) 
ip*^{x)Pipn{x)dx = uh. (60) 



-00 
00 



The solution does not satisfy the consistency conditions given in eqs. (15611571) . Therefore 
consistent solutions are possible only for u = 1. We take u = 1 for the rest of the discussion. 

All ipnix) are stationary solutions and they all have (P) = h. For minimum momentum 
uncertainty we must have minimum (P^). The {ipn ipn) is easily calculated as 6^ + (ra + 
l/2)hw/2. Therefore for minimum momentum uncertainty we must take the ground state: 



1/4 



0(a;)= — e-'^^^-")', (61) 



TT 



where (x) = a. Putting this back into the eq. fl58|l yields 



Ai = -APh^ (62) 
2kh^ = A2 + 6^-Aial (63) 



We also calculate (x^) and (P^) for iPq{ 



{x^) = + l/{Ak) (64) 
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Prom this = (x^) - (x)^ = 1/(4A;) and (al)^^ = (P^) - (P)^ = ^^A; are obtained. This 
yields the uncertainty principle (o'p)minO"x — ^^/4. The Lagrange multiplier Ai has the value 
Ai = — ^^/4(cr^)^ and satisfies the rule 

min ^ ('^p)min 

T/ie physical meaning of the lagrange multiplier is the rate of change of minimum momentum 
uncertainty with position uncertainty which is equal to negative ratio of both uncertainties. 

The Lagrange multiplier A2 has the value 2kh^ — b'^ + Aia^ and it has no obvious physical 
interpretation. We cannot apply the rule A2 = d{ap)min/ du either because we showed that 
the solution does not exist for w 7^ 1. Even if it existed it would still be difficult to attach a 
meaning to it because the for ti 7^ 1 we have no physical meaning to assign to the variable 
u. 
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